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Abstract 

Identifying features of molecular regulatory net- 
works is an important problem in systems biol- 
ogy. It has been shown that the combinatorial 
logic of such networks can be captured in many 
cases by special functions called nested canalyz- 
ing in the context of discrete dynamic network 
models. It was also shown that the dynam- 
ics of networks constructed from such functions 
has very special properties that are consistent 
with what is known about molecular networks, 
and that simplify analysis. It is important to 
know how restrictive this class of functions is, 
for instance for the purpose of network reverse- 
engineering. This paper contains a formula for 
the number of such functions and a comparison 
to the class of all functions. In particular, it 
is shown that, as the number of variables be- 
comes large, the ratio of the number of nested 
canalyzing functions to the number of all func- 
tions converges to zero. This shows that the 
class of nested canalyzing functions is indeed 
very restrictive, indicating that molecular net- 
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works have very special properties. The princi- 
pal tool used for this investigation is a descrip- 
tion of these functions as polynomials and a pa- 
rameterization of the class of all such polynomi- 
als in terms of relations on their coefficients. 



1 Introduction 

A central problem of molecular systems biology 
is to understand the structure and dynamics of 
molecular networks, such as gene regulatory, sig- 
naling, or metabolic networks. Some progress 
has been made in elucidating general design prin- 
ciples of such networks. For instance, in [9] it was 
shown that certain graph theoretic motifs appear 
far more often in the topology of regulatory net- 
work graphs than would be expected at random. 
In [6j [10] it was shown that a certain type of 
Boolean regulatory logic has the kind of dynamic 
properties one would expect from molecular net- 
works. And in [2] it was shown that logical rules 
that appear in published Boolean models of regu- 
latory networks are overwhelmingly of this type. 

These rules, so-called nested canalyzing rules, 
are a special case of canalyzing rules, which are 
reminiscent of Waddington's concept of cana- 



1 



lyzation in gene regulation p3j. Nested can- 
alyzing Boolean rules were shown in [4J to be 
identical with the class of unate cascade func- 
tions, which have been studied extensively in 
computer engineering. They represent exactly 
the class of Boolean functions that result in bi- 
nary decision diagrams of shortest average path 
length [lj. This in itself has interesting impli- 
cations for information processing in molecular 
networks. One consequence of this result is that 
a formula derived earlier for the number of unate 
cascade functions of a given number of variables 
[12J applies to give a formula for the number of 
nested canalyzing Boolean functions, described 
in [3J. A formula for the number of canalyzing 
Boolean functions had been given in [5]. 

Many molecular networks cannot be described 
using the Boolean framework, since more than 
one threshold for a molecular species might be 
required to represent different modes of action. 
There are several frameworks available for mul- 
tistate discrete models, such as so-called logical 
models, Petri nets, and agent-based models. It 
has been shown in [13] and [3] that all these 
model types can be translated into the general 
and mathematically well-founded framework of 
polynomial dynamical systems over a finite num- 
ber system. In |10j the concept of nested cana- 
lyzing logical rule has been generalized to such 
polynomial systems. It has been shown there, 
furthermore, that a large proportion of rules 
in multistate discrete models are indeed nested 
canalyzing, showing that this concept captures 
an important feature of the regulatory logic of 
molecular networks. 

As was pointed out in [5] and [4j , knowing the 
number of nested canalyzing rules for a given 
number of input variables and for a given num- 
ber of possible variable states is important be- 
cause on the one hand it provides an estimate of 



how plausible it is that such rules have evolved 
as regulatory principles and, on the other hand, 
provides an estimate of how restrictive the set 
of rules is. The latter is important, for instance, 
for the reverse-engineering of networks. If the 
set of rules is sufficiently restrictive, then the 
reverse-engineering problem, which is almost al- 
ways underdetermined due to limited data, be- 
comes more tractable when restricted to reverse- 
engineering networks consisting of nested cana- 
lyzing functions. In this paper we present a for- 
mula for the number of nested canalyzing func- 
tions in a given number of variables and show 
that the ratio of nested canalyzing functions and 
all multistate functions converges to zero as the 
number of variables increases. We follow the 
approach in [3J and solve the problem within 
the framework of polynomial dynamical systems, 
which makes it possible to frame it as a problem 
of counting solutions to a system of polynomial 
equations. 

2 Nested Canalyzing Functions 

As mentioned in the previous section, it is pos- 
sible to view most discrete models within the 
framework of dynamical systems over a finite 
number system, or finite field. For our purposes 
we will use the finite fields ¥ p = {0, 1, . . . ,p — 1}, 
p an arbitrary prime number, otherwise known 
as Z/p, the integers modulo p. Furthermore, 
we will assume that ¥ p is totally ordered under 
the canonical order, that is, its elements are ar- 
ranged in linear increasing order, F p = {0 < 1 < 
• • • < p — 1}. Let F = F p for some prime p. 
We first recall the general definition of a nested 
canalyzing function in variables xi,...,x n from 
[10] . The underlying idea is as follows: A rule 
is nested canalyzing, if there exists a variable x 
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such that, if x receives certain inputs, then it by 
itself determines the value of the function. If x 
does not receive these certain inputs, then there 
exists another variable y such that, if y receives 
certain other inputs, then it by itself determines 
the value of the function; and so on, until all 
variables are exhausted. 

Definition 2.1. Let S{ C F, i = 1, . . . , n, be sub- 
sets that satisfy the property that each Si is a 
proper, nonempty subinterval of ¥; that is, ev- 
ery element of¥ that lies between two elements 
of Si in the chosen order is also in Si. Further- 
more, we assume that the complement of each 
Si is also a subinterval, that is, each Si can be 
described by a threshold Si, with all elements of 
Si either larger or smaller than Sj. Let a be a 
permutation on {1, . . . ,n}. 

• The function f : F n — > ¥ is a nested 
canalyzing function in the variable order 
x a ti\ , . . . , x a ( n -\ with canalyzing input sets 
Si , . . . , S n C F and canalyzing output val- 
ues bx, ... , b n , b n+ i G F with b n ^ b n+1 if it 
can be represented in the form 

f(XXj ■ ■ ■ i x n ) = 

h ifx a{ x) e Si 
b2 ifx a (x) i Si,x a{ 2) G S 2 
< : 

b n ifx a (x) £ Si,.. .,x a ( n ) G S n 

. b n +i if x a(l) T Si, ... , X CT (n) T S n 

• The function f :¥ n — > F is a nested canalyz- 
ing function if it is a nested canalyzing func- 
tion in some variable order x c (x), . . . ,x a c n ) 
for some permutation a on {1, ... ,n}. 

It is straightforward to verify that, if p = 2, 
that is F = {0, 1}, then we recover the defini- 
tion in [B] of a Boolean nested canalyzing rule. 



As mentioned above, several important classes of 
multistate discrete models can be represented in 
the form of a dynamical system / : F n — > ¥ n , 
so that the concept of a nested canalyzing rule 
defined in this way has broad applicability. 

3 Polynomial form of nested 
canalyzing functions 

We now use the fact that any function / : ¥ n — > 
¥ can be expressed as a polynomial in n vari- 
ables [H p. 369]. In this section we determine 
the polynomial form of nested canalyzing func- 
tions. That is, we will determine relationships 
among the coefficients of a polynomial that make 
it nested canalyzing. We follow the approach in 
[3]. Let B n be the set of functions from ¥ n to 
F, i.e., B n = {f :¥ n — >¥}. The set B n is en- 
dowed with an addition and multiplication that 
is induced from that of F, which makes it into 
a ring. Let / be the ideal of the ring of poly- 
nomials ¥[xx, ■ ■ ■ ,x n ] generated by the polyno- 
mials {x? — Xi} for all i = l,...,n, where p 
is the number of elements in F. There is an 
isomorphism between B n and the quotient ring 
F[xi, . . . , x n ]/I which is also isomorphic to 

R = I ^h—in x l x 2 ' ' ' X% n f 

Mii,...,in) J 

t=l,...,n 

Now we use this identification to study nested 
canalyzing functions as elements of R. 

Given a subset S of F, we will denote by Qs 
the indicator function of the complement of S, 
i.e., for xq G F, let 

. . / if x Q G S 
Qs{xo) = { 1 if x Q i S 
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We will derive the polynomial form for Qs(x) 
in Lemma l8.2i The following theorem gives the 
polynomial form of a nested canalyzing function. 

Theorem 3.1. Let f be a function in R. 
Then the function f is nested canalyzing in 
the variable order x\, . . . ,x n with canalyzing in- 
put sets Si, . . . ,S n and canalyzing output values 
bi,...,b n , b n+ i with b n / b n+1 , if and only if it 
has the polynomial form 

n-l , n-j ■. 

f(xi,...,X n ) = (bn-j+i - b n -j)Y[ Qs t ( x i) | 

j=0 ^ i=l ' 

(3-1) 

where Qs, is defined as in Lemma \8.Si 

Proof. Let / be a nested canalyzing function as 
in Definition 12.11 and let 

n— 1 f n— j 

g(xi,. ..,x n ) = ^2l On-j+i - b n -j)Y[ Q Sl (xi) 

j=0 i=l 

+&1 

Since g has the right form to be in R, we can 
use the isomorphism between B n and R, to re- 
duce the proof to showing that 

g{ai, ...,a n ) = /(%,. . .,a n ) 

for all (ai,... ,a n ) G F n . 

If a\ G Si, then Qsi(a>i) = 0, therefore 

g(ai, . . . ,a n ) = b\ whenever ai G S\. 

If ai ^ Si and 02 G S 2 , then Qs^iflx) = 1 and 
Qs 2 ( a 2) = 0j therefore 

g(ai, ...,a n ) = (b 2 - bi) + bi = b 2 . 



Iterating this process, if a\ S\,a 2 £ 
S 2 ,...,a n G S n , then QsAai) = 1, <2s 2 (a 2 ) = 
1 3 . . . , and Qs„(a n ) = 0, therefore 

g(ai,. . .,a n ) = (b n -b n -i)-\ h(6 2 -fei)+6i = b n . 

Finally, ii ai £ Si, . . . ,a n £ S n , then Qs 1 (ai) = 
1, Qs 2 («2) = !,•••, and Qs n (a n ) = 1. Therefore, 

ff(ai,. . . ,a n ) = 

(fe n+ i - 6 n ) H h (62 - 61) + fei = fen+l 

This completes the proof. □ 
+61 

Theorem 3.2. Lei f be a function in R. Then 
the function f is nested canalyzing in the vari- 
able order Xo-m , ■ ■ ■ , aw n ) with canalyzing in- 
put sets Si, . . . ,S n and canalyzing output values 
6l, ... , b n ,b n+ i with b n ^ fe n+ i, i/ and on/y z/ ii 
has the polynomial form 

I f{xi,--- ,x n ) = 

> n-l , n-j . 

(fen-i+1 - fen-j)XI f + 6 1 

j=0 ^ i=l ^ 

where Qs a u\ is defined as in Lemma \8.$\ . 

Proof. The proof is very similar to the proof of 
Theorem 13.11 

□ 

4 The algebraic variety of 
nested canalyzing functions 

Here we derive a parametrization for the coef- 
ficients of any nested canalyzing function. We 
will use this parametrization to derive a formula 
to compute the number nested canalyzing func- 
tions for a given number of variables within a 
finite field in the next section. 
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Recall that elements of B n = {/ : F n -> F} Theorem 4.2. Let f <E B n be given by 
can be seen as elements of 



f{x\, . . . , x T: 



R 



EC ■ -r^r* 2 • • • T in 

(h, —,in) 

itSF 
t=l,...,n 



...,«„) 
it€F 
t=l,...,n 



i\...i„^l -^2 



(4.1) 

The polynomial f(x±, . . . ,x n ) is a nested cana- 
lyzing function in the variable order x±,...,x n 
Now, as a vector space over F, R is isomorphic mth canalyzmg mput sets Sl ,...,S n and cana- 
to via the correspondence lyzmg Qutput vahes ^ ^ ^ lf and mly if Us 

EC h i'2 i n v_v ( r 1 \ coefficients satisfy the following equations: 

(ii,...,i n ) 

hev 

t=l,...,n 



n—fi 



We will identify the set of nested canalyzing 
functions in R with a subset V nc f of ¥ p " by 
imposing relations on the coordinates of its el- 
ements. We are going to use the following nota- 
tion: 

Notation 4.1. For r G F and for S C F, 

Cm = C r ... r 



^M\W = ^ r —i— r w ^ ere the value j 



°[0]\{l,..,n-ji} 11 U [P-1] U [J 



m} ii w [p-ir[p-i]\{j} 

(4.2) 

/or ji = 0, . . . , n — 1, where 



<Vi] = (6n+i " b n )(p - l) n Il I Si I . ( 4 -3) 

1=1 



goes in the ith position. 



U [r]\{l,...,j} - ^ii,-,i,-,r...r 



where the values i±, . . . , ij go in the 
1,. . . ,j positions, respectively. 



/or ^ ^ 0,p — 1, and 



1 ( ^ r^ 1 ^ ) C ', 



[p-1], 

(4.4) 



^-iixo} = (p- 1 ) I s l T 1 QsM c \p-^ ( 4 - 5 ) 

for j = l,...,n-l. 



C il ... in _ j o...o, i-e., in-j^Qand 



U [0]\{l,...,n-^} - 



i s = /or all s > n — j. 

S c = ¥\S, i.e., S c denote the complement of S. 



■n—fi 



1 V 



i=l 



J 3=0 



n-j 



i=n— /i+l 



(4.6) 
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°[0]\{ 



i} C b-i]\{i} ~ ^b-i] ( C [o] - &i) (4-7) 

where Qs (0) is defined as in Lemma \8.S\ and 
B n -j = (K-j+i - b n ^j) for j = 1, . . . , n - 1. 

The proof of Theorem 14.21 is given in Ap- 
pendix 2. We now need to provide a similar 
parametrization for functions that are nested 
canalyzing with respect to an arbitrary variable 
ordering. 

Theorem 4.3. Let f E B„ given by 



f (x\ , . . . , x n ) 



Eri. . J-i^h . . . ™in 
'^i 1 ...i n - L i ^2 x n • 

(tl,...,in) 

her 

t=l,...,n 



(4.8) 

The polynomial f(x\,...,x n ) is a nested 
canalyzing function in the variable or- 
der x a(i) > • • • > x a(n) with canalyzing input 
sets Si,...,S n and canalyzing output values 
,b n+1 if and only if, 



Cji...«„_ M — 

U [0]\{ O -(il)... ( 7(i„_ M )} 



n ^Vhm'W} 

(4.9) 



/or jU = 0, . . . , n — 1, where 



Ci 



b-i] 



(6n+l-M(p-l) n IIl 5 <i) 



(4.10) 



i=l 



C [p-i]\Mi)} 



(P 



(4.11) 



/or i, ^ 0,p — 1, and 



°[p-i]\Mi)} 



(p- 1-1 Ov W (o)qp-i], 

/or j = 1, . . . ,n - 1. 



(4.12) 



[0]\{o-(ii)...o-(i n _ M )} 
n— /i 



i=l 

7=0 i=n— u+1 



°[0]\wi)} o b-i]\{"(i)} 



(4.13) 



C[ P -i] I Ci 



(4.14) 



where Qs CT y) (0) is defined as in Lemma \8.S\ and 



B n _j = (b n - j+1 - b n ^j) for j = 1, . . . , n - 1. 



Proof. The proof follows the same line of reason- 
ing used for the proof of Theorem 14.21 □ 



Remark 4.1. Notice from Equation 14.101 that 
a nested canalyzing function f(x\, . . . , x n ) with 
canalyzing input sets Si, . . . ,S n and canalyzing 
output values b%, . . . , b n , b n+ \ is also a nested 
canalyzing function in the same variable order 
with canalyzing input sets S\, . . . ,S% and can- 
alyzing output values b±, . . . , & n +i, b n . In fact, 
Equation 14.101 implies that 
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C [p _ 1] = (b n+1 -b n )(p-l) n ll\S i 



a(i) 



n—1 



(b n -b n+1 )\s a{n) \(p-irn\s c a 



(i) 



i=l 



Note that 



QC 



(mod p). 



Remark 4.2. For every nonzero 6 G F and for 
every nested canalyzing function f(x\, . . . , x n ) 



in the variable order x 



cr(l) ! • • • ) x cr(n) 



with cana- 



lyzing input sets Si, . . . ,S n and canalyzing out- 
put values 61, ... , &n+i, f(x\, . . . , x n ) + b is also 
a nested canalyzing function in the same vari- 
able order and with the same canalyzing in- 
put sets S\ , . . . , S n and canalyzing output values 
bi + 6, ... , b n+ i + b. In fact, for Si,...,S n and 
bi + b, ... , 6 n+ i + b Equations 14.91 - 13.131 stay the 
same and Equation 14. 141 becomes 



o [0]\mi)}°[p-i]\{<t(i)} 



C[p-i] I C\i 



(h + b) 



5 Number of nested canalyzing 
functions 

Here we derive a formula to compute the number 
of nested canalyzing functions in a given number 
of variables n and a given finite field F with p 
elements. 

Let us denote the number of distinct nested 
canalyzing functions in n variables by NCF{n) 
and the number of distinct nested canalyzing 
functions that can be written as a product of 
r nested canalyzing functions by RNCF(n,r) 
(R for reducible) . It is clear from Formula 13.11 
that any nested canalyzing function can be writ- 
ten product of at most n nested canalyz- 
ing functions. Hence RNCF(n, r) = for all 



r > n. We will denote the number of distinct 
nested canalyzing functions in n variables that 
cannot be written as a product of two or more 
nested canalyzing functions by INCF{n) (I for 
irreducible) and the number of distinct nested 
canalyzing functions in n variables that can be 
written product of two or more nested can- 
alyzing functions by RNCF(n). Then 

n 

RNCF(n) = ^RNCF{n,r) 

r=2 

The following lemma relates RNCF(n) and 
INCF(n). 

Lemma 5.1. For each natural number n, we 
have INCF{n) = (p - l)RNCF{n). 

Proof. From Remark 14.21 f° r each / € 
RNCF(n), there are p — 1 functions in 
INCF(n). Conversely, Let / € INCF{n). 
From formula 13. 1\ 



/Ol) 
n-l 



n-l ^ n—3 ^ 

7=0 ^ i=l ' 



3=0 

Let b = p — bi . Hence 
f(xi, ...,x n ) + b = 



n— 1 s 

Yj\ ( b n-J+l 
j=0 L 



n-j 



J n-3 



i=i 



■n—2 , n—j 
j=0 ^ i=2 



+{b 2 - bi) 



Therefore, any element of INCF(n) can be ob- 
tained from an element of RNCF(n). □ 
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The following theorem gives us a formula to 
compute the number of nested canalyzing func- 
tions for a given number of variables n. 

Theorem 5.2. The number of nested canalyzing 
functions in n variables, denoted by NCF{n), is 
given by 

NCF(n) = P RNCF(n), 

where 

RNCF(1) = (p-1) 2 , 
RNCF{2) = 4(p - l) 4 , 
and, for n ^ 3, 

RNCF(n) = 




(p - l) r RNCF(n - r + 1) 



+2 n ~ 1 (p - l) n+1 (2 + n(p - 2)). 

Proof. Prom Remark 14. 2\ in order to calculate 
NCF(1) it is enough to calculate the number 
of nested canalyzing functions with canalyzing 
output values (0, 62), with 62 ^ 0, and then mul- 
tiply by p, because of the isomorphism between 
the sets of vectors {(0,62)} and {(61,62)} with 
62 ^ b\. This isomorphism is given by 

(0,62) (buh + 62) 

for b\ = 1, . . . ,p — 1. For output values of the 
form (0,62), Formula 13.11 gives us 

f(xi) = b 2 Qs 1 (x 1 ) 

There are p—1 choices for 62 and p— 1 choices for 
Qsi(^i)- Note that we do not consider 2(p — 1) 
choices for Qsx(xi) because a nested canalyzing 



function with canalyzing input set Si and cana- 
lyzing output values (0, 62) is also a nested cana- 
lyzing function with canalyzing input set Sf and 
canalyzing output values (62, 0) (see remark liTT]) . 
Therefore 

RNCF(1) = {p-lf, 

Similarly, in order to calculate NCF(2), it is 
enough to calculate the number of nested can- 
alyzing functions with canalyzing output values 
(0, 62, 63), with 63 ^ 62, and then multiply by p, 
because of the isomorphism between the sets of 
vectors {(0,6 2 ,6 3 )} and {(61,62,63)} given by 

(0, 6 2 , 63) +{blMM \ ( bl ,h + 6 2 , 61 + 63) 

for 61 = 1, ... ,p — 1. For output values of the 
form (0,0,63) formula 13.11 gives us 

f{x\) = b 3 Q Sl {xi)Qs 2 {x2) 

There are p— 1 choices for 63 and 2(p — 1) choices 
for each of the Qsi( x i) for i = 1,2. Therefore, 
for (0,0,63) there are A(p — l) 3 functions. 

For (0, 6 2 , 63), where 6 2 / 0, 63 / 0, and 63 / 
62, Formula 13.11 give us 

f(xi) = Q Sl (xi) ((63 - 6 2 )Q 52 (x 2 ) + 6 2 ) 

There are 2{p — 1) choices for Qsi(x\), p—1 
choices for 62, p — 2 choices for 63, and p—1 
choices for Qs 2 { x 2)- Note that we do not con- 
sider 2{p — 1) choices for Qs 2 ( x 2) because a 
nested canalyzing function with canalyzing in- 
put sets Si, S2 and canalyzing output values 
(0, 0, 63) is also a nested canalyzing function with 
canalyzing input sets Si , S% and canalyzing out- 
put values (0, 63, 0) (see Remark [4. ip . Therefore, 
there are 2(p — l) 3 (p — 2) functions. If we count 
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the number of functions after permuting the vari- 
ables, we get 

RNCF{2) = 4(p - l) 3 + H2(p " 1) 3 (P " 2), 

Simplifying the formula above we get 

RNCF{2) = 4(p - l) 4 . 

For n ^ 3 let us compute the number of dis- 
tinct nested canalyzing functions that can be 
written product of n nested canalyzing func- 
tions, RNCF(n,n). From Formula 13. II it is clear 
that a nested canalyzing function can be written 
as a product of n nested canalyzing functions 
if and only if the output values must have the 
form (0, . . . , 0, b n , b n+1 ), where b n+ i / b n . First 
consider the case where b n = 0, i.e. the out- 
put values have the form (0, . . . , 0, b n+ i), with 
6 n +i 7^ 0, for which we have: 

/(#!,••• ,x n ) =b n+1 Q Sl (x 1 ) ...Q Sn (x n ) 

There are p — 1 choices for 6 n +i and 2(p — 1) 
choices for each of the Qsi(%i) for i = 1, . . . , n. 
Therefore, there are 2 n (p—l) n+1 functions. Note 
that if we permute the variables, we will still get 
the same functions. 

For (0, . . . , 0, bnib n +i), where b n +\ 

+ 0, b n ± 
0, and b n+ \ ^ b n , Formula 13. II gives us 

f( x l j ' ' ' ! Xn) = 

QSi (»l) ■ ■ ■ QSn-1 ( ) \ (b n +i — b n )Qs„(x n ) 



There are 2(p — 1) choices for each QSi( x i) f° r 
% = 1, . . . , n — 1, p — I choices for 6 n , p — 2 
choices for b n+ i, and p — 1 choices for Qs„(x n ). 
Note that we do not consider 2(p— 1) choices for 



Qs n ( x n) because a nested canalyzing function 
with canalyzing input sets S± , . . . , S n and can- 
alyzing output values (0, . . . , 0, b n , b n+ i) is also a 
nested canalyzing function with canalyzing input 
set Si, ... , 5 n _i, S n and canalyzing output val- 
ues (0, . . . , 0, b n+ i,b n ) (see Remark l4.ip . There- 
fore, there are 2 n ~ l (p — l) n+1 (p — 2) such func- 
tions. If we count the number of functions after 
permuting the variables, we get 

RNCF(n, n) = 

2 n (p - l) n+1 + Q 2 n - 1 (p - l) n+1 {p - 2) 

= 2 n ' 1 (p - l) n+1 (2 + n{p - 2)). 

Let us now compute the number of distinct 
nested canalyzing functions that can be written 
as a product of n — r nested canalyzing func- 
tions, RNCF(n, n — r) for r = 1, . . . , n — 2. 
From Formula I3.1| it is clear that a nested 
canalyzing function can be written as a prod- 
uct of n — 1 nested canalyzing functions if 
and only if the output values have the form 
(0, . . . , 0, b n - r , ...,b n , b n+ i), where 6 n _ r / and 
b n +i 7^ b n . In this case we have: 

f(x\, • • • , Xn) = 
n—r—1 

n QsM) 

r , n-j . 

(bn-j+1 ~ b n -j) Y[ Qs i {Xi) + b n -r f 
j=0 i=n—r 

There are 2(p — 1) choices for each Qsi(%i) f° r 
i = 1, . . . , n — r — 1 and INCF(r + 1) func- 
tions for the summation part. Note that here we 
do have repetition coming from Remark |4TT] be- 
cause b n - r ^ 0. Therefore, there are 2 n ~ r ~ 1 (p — 
l) n ~ r ~ 1 INCF(r-\-l) such functions. If we count 
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the number of functions after permuting the vari- we have 
ables, we get 



RNCF(n, n — r) 



RNCF(n) 



n 



(5.1) 

Now since +RNCF(n, n), 

RNCF(n)= From Lemma EH 

n Ti—2 

RNCF(n, r) = Y J RNCF(n, n - r), RNCF(n) = 

r=2 r=0 

we have 

n-2 

RNCF(n) = J2RNCF(n,n-r)+RNCF(n,n 



n-l , ^ 

Y.^\) 2M_1 ^ ~ V^INCFin - n + 1) 



n-l 



r=l 



2 " _1 (P " ±YRNCF{n - n + 1) 
+i?iVCF(n,n), 



u=2 



n-2 , x 

E "J 2 n - r - 1 (P - l) n - r - 1 JiVC J F(r + 1) 

r=l ^ ' 



Now, replacing equation 15.11 in the previous for- 
mula, we have where 

RNCF(n) = RNCF(n, n) = 2 n ~ l {p - l) n+1 (2 + n{p - 2)). 

(5.2) 

Finally, again using Lemma 15.11 
NCF(n) = RNCF{n) + INCF{n) = P RCF(n) 
+RNCF(n,n). This completes the proof. □ 

If we make the change of variable \i = n-r, then Examp i e 5.3 (Boolean case). Jarrah et. al. \fl 
Z 1- 1 = n—r — 1 andr + 1 = n — fi+1. Therefore, s how that the class of Boolean nested canalyzing 
RNCF(n) = functions is identical to the class of unate cas- 

cade functions. Sasao and Kinoshita found 
n-l , x a recursive formula for the number of unate cas- 

I I 2 M ~ 1 (p — l)^ 1 INCF{n — pL + Vfiade functions. Therefore, the same formula can 

n=2 V n + / /) e used to compute the number of Boolean nested 

canalyzing functions. Below is the formula origi- 
+RNCF(n, n). nally given by Sasao and Kinoshita which is 

„ , . a particular case of our formula in Theorem[ 

But since, , 

namely when p = 2: 

n \ ( n 



n-{n-l)J ~ \n-ir NCF(n) = 2E(n), 
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Table 1: Number of nested canalyzing functions 



n = 


= 1,...,8 


n 


NCF(n) 


1 


12 


2 


192 


3 


5568 


4 


219648 


5 


10834944 


6 


641335296 


7 


44288360448 


8 


3495313145856 



Table 2: Number of nested canalyzing functions 
for p = 5 and n = 1, . . . , 8 



n 


NCF(n) 


1 


80 


2 


5120 


3 


547840 


4 


78561280 


5 


14082703360 


6 


3029304606720 


7 


760232846295040 


8 


218043057365319680 



where 

E(l) = 2, E{2) = 4, 

and 

71-1 /• \ 

E(n) = J2( n )2 r - 1 E{n-r + l) + 2 n . 

r=2 V r _ / 

Tables [1] - [2] show the number of nested can- 
alyzing functions for p = 3, 5 and n = 1, . . . , 8. 



6 Asymptotic properties of 

NCF{n) 

In this section we examine the asymptotic prop- 
erties of the formula given in Theorem 15.21 i.e. 
we want to know the behavior of NCF{n) as n 
becomes large. 

First we derive the following inequalities: 

Lemma 6.1. For fixed n and r = 2, . . . , n — 1, 

we have 

2 r - 1 {p - l) r RNCF{n - r + 1) < RNC{n) 

Proof. It is a direct consequence of the formula 
given at Theorem 15.21 □ 

Lemma 6.2. For all natural numbers n, we have 

RNCF(n, n) < 2 2n {p- l) n+2 . 
Proof. From equation 15. 2| 

RNCF(n,n) = 2 n ' l {p - l) n+1 (2 + n(p - 2)) 
<j 2 n - l {p- l) n+1 (2 + 2 n (p- 1)) 
<j 2"- 1 (p - l) n+1 (2 n+1 (p - 1)) 
< 2 2n (p-l) n+2 

□ 

Lemma 6.3. For aZZ natural numbers n ^ 3, we 
have 

RNCF(n) < 2 n ^ n - l \p- l) 2n . 
Proof. We prove this by induction over n. First 
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note that for n = 3, 

RNCF(3) = (^J2 l (p-1) 2 RNCF(2) 

+2 2 {p- l) 4 (2 + 3p-6) 
= 24(p - l) 6 + 4(p - l) 4 (3p - 4) 

< 2 5 (p - l) 6 + 4(p - l) 4 (3(p - 1) - 1) 

< 2 5 (p- l) 6 + 2 5 (p- l) 6 

= 2 6 (;p-l) 6 = 2 3 ( 3 " 1 )(p- 1) 2 ( 3 ) . 

Now, assume 

RNCF(n) < 2< n - 1 \p - l) 2n . 

Then 
RNCF(n + 1) = 

2 ( n ^ J) 2r " 1(p " l-YRNCFfa -r + 2) 

r=2 ^ ~ ' 

+RNCF(n + l,n + l) 
= ^^(p - l) 2 RNCF(n) 

r=3 ^ r _ ' 

+RNCF(n+ l,n + 1) 



From Lemma 16. II we have 

^ 1 !) 2r_1 ^ " l Y RNC F(n -r + 2) 

r=3 ^ r _ ' 

= S + X ) " l) r+1 ^CF(n - r + 1) 
J2(p- l)iiiVCF(n) 

r=2 V r / 

Therefore 

RNCF(n + 1) = 

sC f n | M 2(p - l) 2 RNCF(n) 

n—l / , -I \ 

J2(p- l)i?iVCF(n) 

+RNCF(n + l,n + l) 

< 2(p - l) 2 i?iVCF(n) + 

+RNCF(n + l,n + l) 

Using the inductive hypothesis and Lemma 16.21 



we have 

RNCF{n + 1) 



< 2(p - l) 2 2 n ^- l \p - l) 2n £ ( U + r X ) 

r=l ^ ' 



+2 2(n+l)( p _ iyn 



+3 



2 n(n-l)+l^ _ 1 )2(n+l) / n + A 



+2 2 ( n+1 )(p- l) n 



+3 



Now since 



n-l 



Theorem 6.4. The ratio NCF(n) /ip(n) con- 
verges to as n becomes large. 



Proof. 



NCFjn) 
V>(«) 



< 



< 



< 



NCFjn) _ pRNCFjn) 
p(2™(™- 1 )(p~l) 2 ™) 

TTJ 

pP 

2 77.(71 — 1)^277 + 1 
T7T 

TTT 



„77 +77+1 

E — > as n 



oo 



r=l 

we have 

RNCF(n + 1) < 

2 n(n-l)+l(p_ j_\2(n+l)/ 2 n+l) 

+2 2 ( n + 1 )(p _ l)™+3 

< 2 n2+2 (p- l) 2 ( n+1 ) 

+ 2" 2 +™- 1 (p _ l)2(n+l) 

^ 2 ( n+1 ) n (p - l) 2 ^ 1 ). 

Note that n — 1 ^ 2. This completes our proof. 

□ 

Let us denote the number of all possible func- 
tions on n variables by ip(n). The following the- 
orem show that the set of all nested canalyzing 
functions is an increasingly smaller subset of the 
set of all functions. 



because the exponential function p n grows much 
faster than the quadratic function n 2 + n + 1 as 
n becomes large. □ 

7 Discussion 

The concept of a nested canalyzing rule has been 
shown to be a useful approach to elucidating 
design principles for molecular regulatory net- 
works, and such rules appear very frequently in 
published network models. But how much of a 
restriction do such rules impose on the regula- 
tory logic of the network, that is, how "special" 
are such rules? It was shown in [10] that net- 
works with nested canalyzing rules have very 
special dynamic properties. In this paper we 
have shown that nested canalyzing rules do in- 
deed make up a very small subset of all possible 
rules. In particular, we provide an explicit for- 
mula for the number of such rules for a given 
number of variables. 

This was done by translating the problem into 
the mathematical context of polynomial func- 
tions over finite fields and the language of alge- 



13 



braic geometry. The formula we provide uses a 
parametric description of the class of all nested 
canalyzing polynomials as an algebraic variety. 
In particular, this provides a very easy way to 
generate such polynomials through particular 
parameter choices, which is very useful, for in- 
stance, for large-scale simulation studies. 

Another interesting aspect of the formula de- 
rived in this paper is as a future discovery tool. 
The formula for Boolean nested canalyzing func- 
tions in [3] was obtained essentially through 
serendipity. Once the parameterization of this 
class was obtained it was possible to explicitly 
compute the number of solutions of the paramet- 
ric equations for small numbers of variables. The 
resulting integer sequence, giving the number of 
Boolean nested canalyzing rules for small num- 
bers of variables was matched to the number of 
Boolean unate cascade functions, for which a for- 
mula is known. It was shown in [3] that the two 
classes of functions are in fact identical. This 
is of independent interest, since Boolean unate 
cascade functions have been shown to lead to 
binary decision diagrams with smallest average 
path length, suggesting that they are very ef- 
ficient in processing information. The formula 
for multistate nested canalyzing rules provides 
a similar opportunity. While there is no obvious 
match to other function classes for small fields, it 
is worth, in our opinion, to pursue this discovery 
approach further for larger fields. 



8 Appendix 1 

In this appendix we derive the polynomial form 
for the indicator functions Qs- First, for each 
r £ F we denote the indicator function of the 



singleton set {r} by P r , i.e., for xo € F, 

1 if xq = r 



P r (x Q ) 



if xq ^ r 



The polynomial form of P r is given in the follow- 
ing lemma. 

Lemma 8.1. For r £ F, we have 

P r {x) = (p-l)H(x-a), 

aGF 

which has the expanded form 

Pr(x) = 



(p-1) 



x p-l _|_ rx P-2 + r 2 x p-3 _| j_ r P~2 x _|_ J| 



aGF 
a^r 



Proof. Let g{x) = (p — l)TJ(x — a). We want 

aGF 
a^r 

to prove that g{x) = P r {x) for all x € F. Clearly 
g{xo) = if xq 7^ r. It remains to prove that 
g(r) = 1. From the definition of g, it can be 
expanded as 

g(x) = (p — l)(x(x — 1) ■ ■ ■ {x — (r — 1)) 
■■■ (x - (r + !))■■■ (x-(p- I))). 

Then 

g (r) = (p- l)r\{-iy- l ~ r (p - 1 -r)l 
= (p-i)(_i)r(_i)(_2)...(_r) 
{-l) p - l ' r {p - 1 - r)! 

Now, since p — i = —i (mod p) for i = 1, . . . , r, 
we get 

g(r) = {p-l){-iy- l (p-l)(p-2)--- 
(p — r){p — 1 — r)! 
= (p-l)(p-l)! 

= (p — l)(p — 1) (from Wilson's Theorem) 
= 1 
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This proves the first assertion. For the second 
claim, from the previous formula for P r , we see 
that P r is a polynomial of degree p — 1, so P r can 
be written in the form 

P r {x) = {p-l)[xP- 1 + a p ^ 2 x p - 2 



+a p - 3 x p 3 H \-aix + a ], 



where 



"P-3 



E h 

6i,...,6j6F 
bi,...,bj^r 

for j £ {2, . . . ,p — 1}. Then 



1...6 



p— l p— l r 

(-tfE- E 

6i=0 6j-i=0 L 6jeF 



■ b i-i h 3 



1 j ' E • • • E k-&,--i5> 

6l=0 bj_i=0 L 6jSF 
bj-i^r bj^r 



p—1 p— 1 
bi=0 6 3 -_i=0 



6x ... 6j_i(-r) 



£ W...bjM-r) E 

6i=0 6j-_i=0 L 6j_ieF 



p-1 p-1 

(-tf£- E 

b 1= bj_i=0 



bi ... 6j_ 2 (-r)(-r) 



Finally, 



«o=(-i) p - i n a =ri a - 



a€F aeF 

Remark 8.1. Note that p — 1 is even for p > 2 
and 1 = —1 for p = 2. 

This completes the proof. □ 

Finally, the polynomial form of Qs is given in 
the following lemma. 

Lemma 8.2. For ScF, we have 

Q S (X)= E P ^ X ) 

re¥\S 

Proof. Clearly, if xo £ S, then P r (xo) = for all 
r 6 F\S". Therefore 

Qs(* )= E ^o) = 

reF\S 

Similarly, if xq ^ S, then P xo (xo) = 1 and 
P r (a?o) = for all r ^ xq in F\S\ Therefore 

Qs(zo) = E P r0o) = 1- 

r£F p \S 

This completes the proof. 



□ 



9 Appendix 2 

Here we give the proof of Theorem 14.2 



Proof. Let us first assume that the polynomial / 
is a nested canalyzing function with canalyzing 
input sets S±,...,S n and canalyzing output val- 
ues b\, . . . , b n+ \. Then, by Theorem 13.11 / can 
be expanded as 

f(x±, . . . , X n ) = 

n-1 , n-j . 

El ~ b n-j)YlQsA x i) \ + b i 

7=0 ^ i=l ' 
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n-1 



EK-i E 



n— 7 i-i jo 

Ur ■ i/y i X' n > > > „ 



(il,...,i n -j) 
t=l,...,n-j 



+h + ^ \ ( b n-j+l ~ K-j)al . . . Oq J L 
j=0 1 J 

where is defined as in Lemma l&2j i.e., 

QSiiXi) = p r( x i) 
reSf 



for j = 1, . . . , n — 1, and 

U [0]\{l,...,n- M } ~ 



1 n— u n— u+1 n— 7 

a„_i . . . a n ran • • • a, 



El Bn-j",,-i ■ ■ ■ "p-i "o 
j=0 



1 o n_ ^\^/R .„ n -^ +1 n n_ A 

'p-i • • • Q p-i / A n n-j a o ■ ■ ■ a o r 

3=0 1 } 



p-2 x i 



p-2 



Now, from Lemma |8JJ we have that 
C [p -i] 



(b n +l — ^n)ap_l ' ' ' 
n 

(6„ + l-6n)II(P- 1 ) \ S i 



and we have 



£ r 

j=0 



i=l 



1 n— u n— u+1 n— 7 

"m •••"0 



1 n— u 

oz, . . . a- p 



Now, if j ^ n, then 
°b-i]\{i} " 



Ej B n-j a o~ 



. . . Li n 



(bn+i ~ &n)ap_i • • • a> p _ x al a 3 p _ x ... a™ 



n 



i=l 



So 



(6 n+ i - 6n)fjap_ 



i=l 



u b-i]\0> 



II4-i E 

i=l J j=0 ^ 



So 



El D n— u+1 n—j 
) B n-ja p ...a 

L j=o 



n—fi 



ik-i 



i=i 



-i 



°[0]\{l,...,n- M }- 



Now, 



%\ . ..In — fj, 

n — fi r- pu 



3=1 L j=0 
n—fj, 



IK E R x "•' ■■■«;; ' 



H\ (b n +i - b n ) a*_! 



i=l 



°b-i]\{i} 



n—fi 



IL 

j=l 



P-I 



/nP-l,—,p-l 
U [0]\{l,...,n-/4 



1 -1 



= n ( 6 ™+ i ~ °p-i 

i=i L i= i 

n— fj, 

qp-i,...,p-i yj ^i,- 



[0]\{l,...,n-A»} 11 "b-l]\0'} 
j=l 
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n—fi 

n 



Ci 



b-i] 



n—fi 



c^ i v;- p ~ i n c r 



Then 



[0]\{1, ...,n-A*} 11 w b-l]\{i} 



n— u 



O [o]\{i,...,n-M} 11 u [ P -ir[p-i]\m 



•n— -i , 

114 £K 

•i=l J i=O k 



n— n— j 

it o • • • o 



Croi - 6i = Ci 



p— 1 £<0 



[o]\{i}^[p-i]\{i}^b-i]" 



Conversely, suppose Equations [O] - H21 Now ' from Equation EF] we get 
hold for the coefficients of the polynomial 
f(xi, . . . , x n ) in Equation 14.81 We need to show 
that f(xi, . . . j Xfi J is EL nested canalyzing func- Then 
tion. Let a^_ x = (p — 1) | |, 

4 = (p - 1)E rP_1 ^' and a o = ^ (0) for 
j = 1, . . . , n. Then 



C7 



C [0 ] = &i + jN B^-aS . . . a£ 3 > 
j=o 1 J 



= (&n+l - &n) U a p-1: 



and 



i=l 



C [p- i] C b- i]\{i } ~ 4 Hp- i] 1 



(9.1) 



as well as 



U [0]\{1,.. .,»-/,} 



■n— -, u , 
II a P-! E j S n-i«o" 

•1=1 J 7=0 L 



■ • • «o 



Now from Equation 14.2 



n— /Li 



J'=l 



and from Equation 19. II we get 



n—fi 



r>. . _ i7P-l»-,P-l TT 

— u roi\-ri n-ta 11 



in M •>-,,) 1 1 a iHp~i]] 



n—l ( 

E j X! 

"=0 ^(4i,...,4 n _ M ) 

i=l,...,n— /i 

+C [0 ]. 

Therefore 
/(aci, • • • ,x n ) = 



x i • • • x n-j 



n-1 



j=0 



EK* E 4 



f"n. — i J- « 



(il,...,i n -j) 
t=l,...,n-j 
n-1 



^»i...»n-* - u [o]\{i,.. .,n-u} 11 °[p-i]°[p-i]\{i} Finally, 



+&i + E^ 1 ~ b n-j)al 3 \. 



f {x\ , • • • , 3?n) 



n-1 ^ n-j . 

j=0 L 4=1 J 



which is nested canalyzing by Theorem [3TJ This 
completes the proof. □ 
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